Substitutional Analysis of Orthologous Protein Families Using BLOCKS
نویسندگان
چکیده
Orthologous proteins, form due to divergence of parental sequence, perform similar function under different environmental and biological conditions. Amino acid changes at locus specific positions form hetero-pairs whose role in BLOCK evolution is yet to be understood. We involve eight protein BLOCKs of known divergence rate to gain insight into the role of hetero-pairs in evolution. Our procedure APBEST uses BLOCK-FASTA file to extract BLOCK specific evolutionary parameters such as dominantly used hetero-pair (D), usage of hetero-pairs (E), non-conservative to conservative substitution ratio (R), maximally-diverse residue (MDR), residue (RD) and class (CD) specific diversity. All these parameters show BLOCK specific variation. Conservative nature of D points towards restoration of function of BLOCK. While E sets the upper-limit of usage of hereto-pairs, strong correlation of R with divergence-rate indicates that the later is directly dependent on non-conservative substitutions. The observation that MDR, measure of positional diversity, occupy very limited positions in BLOCK indicates accommodation of diversity is positionally restricted. Overall, the study extract observed hetero-pair related quantitative and multi-parametric details of BLOCK, which finds application in evolutionary biology.
منابع مشابه
Using shared genomic synteny and shared protein functions to enhance the identification of orthologous gene pairs
MOTIVATION The identification of orthologous gene pairs is generally based on sequence similarity. Gene pairs that are mutually 'best hits' between the genomes being compared are asserted to be orthologs. Although this method identifies most orthologous gene pairs with high confidence, it will miss a fraction of them, especially genes in duplicated gene families. In addition, the approach depen...
متن کاملComprehensive analysis of orthologous protein domains using the HOPS database.
One of the most reliable methods for protein function annotation is to transfer experimentally known functions from orthologous proteins in other organisms. Most methods for identifying orthologs operate on a subset of organisms with a completely sequenced genome, and treat proteins as single-domain units. However, it is well known that proteins are often made up of several independent domains,...
متن کاملGreenPhylDB: a database for plant comparative genomics
GreenPhylDB (http://greenphyl.cirad.fr) is a comprehensive platform designed to facilitate comparative functional genomics in Oryza sativa and Arabidopsis thaliana genomes. The main functions of GreenPhylDB are to assign O. sativa and A. thaliana sequences to gene families using a semi-automatic clustering procedure and to create 'orthologous' groups using a phylogenomic approach. To date, Gree...
متن کاملFast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads
BACKGROUND Microbiome sequencing projects typically collect tens of millions of short reads per sample. Depending on the goals of the project, the short reads can either be subjected to direct sequence analysis or be assembled into longer contigs. The assembly of whole genomes from metagenomic sequencing reads is a very difficult problem. However, for some questions, only specific genes of inte...
متن کاملBlocks+: a non-redundant database of protein alignment blocks derived from multiple compilations
MOTIVATION As databanks grow, sequence classification and prediction of function by searching protein family databases becomes increasingly valuable. The original Blocks Database, which contains ungapped multiple alignments for families documented in Prosite, can be searched to classify new sequences. However, Prosite is incomplete, and families from other databases are now available to expand ...
متن کامل